Retrospective analysis of haplotype-based case control studies under a flexible model for gene environment association.

نویسندگان

  • Yi-Hau Chen
  • Nilanjan Chatterjee
  • Raymond J Carroll
چکیده

Genetic epidemiologic studies often involve investigation of the association of a disease with a genomic region in terms of the underlying haplotypes, that is the combination of alleles at multiple loci along homologous chromosomes. In this article, we consider the problem of estimating haplotype-environment interactions from case-control studies when some of the environmental exposures themselves may be influenced by genetic susceptibility. We specify the distribution of the diplotypes (haplotype pair) given environmental exposures for the underlying population based on a novel semiparametric model that allows haplotypes to be potentially related with environmental exposures, while allowing the marginal distribution of the diplotypes to maintain certain population genetics constraints such as Hardy-Weinberg equilibrium. The marginal distribution of the environmental exposures is allowed to remain completely nonparametric. We develop a semiparametric estimating equation methodology and related asymptotic theory for estimation of the disease odds ratios associated with the haplotypes, environmental exposures, and their interactions, parameters that characterize haplotype-environment associations and the marginal haplotype frequencies. The problem of phase ambiguity of genotype data is handled using a suitable expectation-maximization algorithm. We study the finite-sample performance of the proposed methodology using simulated data. An application of the methodology is illustrated using a case-control study of colorectal adenoma, designed to investigate how the smoking-related risk of colorectal adenoma can be modified by "NAT2," a smoking-metabolism gene that may potentially influence susceptibility to smoking itself.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Analysis of Case-Control Association Studies: SNPs, Imputation and Haplotypes.

Although prospective logistic regression is the standard method of analysis for case-control data, it has been recently noted that in genetic epidemiologic studies one can use the "retrospective" likelihood to gain major power by incorporating various population genetics model assumptions such as Hardy-Weinberg-Equilibrium (HWE), gene-gene and gene-environment independence. In this article, we ...

متن کامل

Shrinkage Estimators for Robust and Efficient Inference in Haplotype-Based Case-Control Studies.

Case-control association studies often aim to investigate the role of genes and gene-environment interactions in terms of the underlying haplotypes (i.e., the combinations of alleles at multiple genetic loci along chromosomal regions). The goal of this article is to develop robust but efficient approaches to the estimation of disease odds-ratio parameters associated with haplotypes and haplotyp...

متن کامل

Association of ESRα Gene Pvu II T>C, XbaI A>G and BtgI G>A Polymorphisms with Knee Osteoarthritis Susceptibility: A Systematic Review and Meta-Analysis Based on 22 Case-Control Studies

Background: Many studies have reported the association of estrogen receptor α gene (ESRα) ESRα PvuII T>C, XbaI A>G and BtgI G>A polymorphisms with Knee osteoarthritis (KOA) risk, but the results remained controversial. In order to drive a more precise estimation, the present systematic review and meta-analysis was performed to investigate the association between ESRα polymorphisms and KOA susce...

متن کامل

Marginal Analysis of A Population-Based Genetic Association Study of Quantitative Traits with Incomplete Longitudinal Data

A common study to investigate gene-environment interaction is designed to be longitudinal and population-based. Data arising from longitudinal association studies often contain missing responses. Naive analysis without taking missingness into account may produce invalid inference, especially when the missing data mechanism depends on the response process. To address this issue in the ana...

متن کامل

Haplotype Effect of Two Human Leukocyte Antigen-G Polymorphisms of rs1736933 and rs2735022 on the Recurrent Pregnancy Loss

Background: Recurrent Pregnancy Loss (RPL) is a multifactorial disease that affects 1-3% of couples. Since Human Leukocyte Antigen-G (HLA-G) gene is involved in fetal maternal immune tolerance, mutations in the HLA-G gene can affect the success rate of pregnancy. Objective: The present study aims to investigate the haplotype effect of rs1736933 and rs2735022 polymorphisms found in the HLA-G ge...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Biostatistics

دوره 9 1  شماره 

صفحات  -

تاریخ انتشار 2008